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We present a formalism for performance forecasting and optimization of future cosmic microwave 
background (CMB) experiments. We implement it in the context of nearly full sky, multifrequency, 
5-mode polarization observations, incorporating statistical uncertainties due to the CMB sky 
statistics, instrumental noise, as well as the presence of the foreground signals. We model the 
effects of a subtraction of these using a parametric maximum likelihood technique and optimize 
the instrumental configuration with predefined or arbitrary observational frequency channels, 
constraining either a total number of detectors or a focal plane area. We showcase the proposed 
formalism by applying it to two cases of experimental setups based on the CMBpol and COrE 
mission concepts looked at as dedicated 5-mode experiments. We find that, if the models of 
the foregrounds available at the time of the optimization are sufficiently precise, the procedure 
can help to either improve the potential scientific outcome of the experiment by a factor of a 
few, while allowing one to avoid excessive hardware complexity, or simplify the instrument design 
without compromising its science goals. However, our analysis also shows that even if the available 
foreground models are not considered to be sufficiently reliable, the proposed procedure can guide 
a design of more robust experimental setups. While better suited to cope with a plausible, greater 
complexity of the foregrounds than that foreseen by the models, these setups could ensure science 
results close to the best achievable, should the models be found to be correct. 



I. INTRODUCTION 

Cosmic Microwave Background (CMB) 5-mode polar- 
ization offers some of the most exciting goals for the next 
stage of observational and experimental effort in cosmol- 
ogy. These goals are already aimed at by an entire slew 
of current, forthcoming, and planned CMB observations, 
e.g., [1-5]. Probably most importantly, CMB 5-mode 
measurements could open up a window, as direct as likely 
ever possible, onto the physics of the very early Universe, 
giving us unique insights on the physical laws governing 
at the highest energies. Such outstanding, anticipated 
consequences seem to be however matched by difficulties, 
which need to be overcome, first, to deliver an incontro- 
vertible, reliable detection and sufficiently precise char- 
acterization of the primordial B-vaode signal, and later 
to interpret it. The obstacles are of fundamental and 
instrumental origins and stem from the fact that the an- 
ticipated B-mode amplitudes are expected to be nearly 
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2 orders of magnitude below those of CMB E-mode po- 
larization and up to 10 times lower than the 5-mode sig- 
nal generated by the Galactic foregrounds. To meet and 
successfully address such a challenge progressively more 
sophisticated and advanced observatories have to be de- 
vised and built. Their complexity results in a number 
of design choices and decisions instrumental teams have 
to make in the course of their development. As those 
have often a direct impact on the science output of the 
instrument, these are science goals which should drive 
the decision-making process. Though such a situation is 
not new, the sheer size, complexity, and precision of the 
modern instruments and data sets call for novel, more 
robust ways of addressing the instrumental optimization 
problem. 

In this paper we propose a general, methodological 
framework for the experiment optimization and then ap- 
ply it in specific cases of CMB 5-mode observatories. We 
note that, however sophisticated an adopted optimization 
procedure may be, it is likely to always come up short in 
doing justice to all the complexity of an instrument under 
consideration. The goal of such a procedure, as we pursue 
here, is therefore not just to find a single best (in some 
sense) instrumental configuration. Rather, the goal is to 
provide, on the one hand, a reference against which to 
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judge actual hardware designs and, on the other, guide- 
hnes of, first, how to propose, given some science goals, a 
suitable and viable experimental design and, later, how 
to modify it to implement inevitable, real-life limitations 
and constraints in a way which will have a minimal im- 
pact on its scientific performance. 

Though the discussed formalism lends itself straight- 
forwardly to a number of generalizations, in this paper 
we demonstrate it in the context of the B-mode detec- 
tion by multifrequency observatories taking into account 
the presence of the astrophysical (diffuse) foregrounds, 
leaving a study of some of the most common instrumen- 
tal effects to a future work. We note that even in this 
limited context a result of the instrument optimization 
problem will depend on a number of factors: scientific 
goals as set for the experiment in question; models of 
the physical effects, e.g., foregrounds; specific techniques 
and assumptions they require, selected to be used for 
the component separations step. This emphasizes the 
need for using the state-of-the-art physical models of the 
foregrounds and the separation techniques in this kind of 
problem, as well as for continuing effort aiming at better, 
more reliable understanding of the foreground physics. 

As the optimization requires a capability to predict 
the performance of an instrument given its characteris- 
tics, it is very closely connected with performance fore- 
casting. In fact, in most of the similar work to date, 
the problem of selecting the most suitable experimen- 
tal configurations is typically treated as a performance 
forecasting problem applied to some predefined, and lim- 
ited, set of potential candidate experimental setups, the 
relative merits of which are subsequently evaluated and 
compared, e.g., [6-11]. This is in contrast with this pa- 
per, which employs an actual optimization procedure. 
In this respect our approach is most similar to the one 
by Amblard et al. [12]. Here we generalize and extend 
the latter work on both methodological and implemen- 
tation levels. We consider broader parameter space and 
optimization strategies, search for families of acceptable 
configurations, and by adopting the parametric compo- 
nent separation approach as the component separation 
technique of the choice, we manage to propagate real- 
istic ensemble-averaged errors to our selected figure of 
merit indicators in a statistically sound manner. 

The paper is organized as follows. In the next sec- 
tion we first describe a general framework of our ap- 
proach and then specialize it to our specific science case 
of CMB B-mode observations. In that section we show 
how the parametric component separation technique can 
be used to assess the performance of CMB experiments 
in the presence of galactic foregrounds, developing the 
approach to a performance forecasting in such cases. In 
Sec. Ill we detail the foreground model we use in this 
work. Section IV describes applications of the proposed 
formalism to two fiducial satellite experiments, based on 
the CMBpol [4] and COrE [3] proposals. In Sec. V we 
present our conclusions. Some of the lengthy calculations 
are collected in Appendixes A and B. 



II. METHOD 

Our approach is as follows. We start off from express- 
ing our science goals in terms of acceptable ranges of val- 
ues of some proposed figures of merit (Sec. II B), which 
are chosen to reflect the physical context of the consid- 
ered experiment. We then first treat all figures of merit 
(FOMs) separately and for each of them perform a strict 
optimization procedure (Sec. II C), i.e., minimize or max- 
imize it over a set of considered instrumental parame- 
ters. This is usually done in the presence of some exter- 
nal constraints arising for instance due to some hardware 
requirements but also some other science- driven restric- 
tions, (Sec. II C). This first step aims at determining 
the best possible instrument performance from the per- 
spective of the considered FOMs and their corresponding 
configurations. If for any of the FOMs the best perfor- 
mance value does not fulfill our science goals, the proce- 
dure halts and either the set of instrumental parameters 
have to be enlarged or the science goals/FOMs rethought. 
Otherwise, for each FOM, but one, we select a thresh- 
old value, which need to be attained by any acceptable 
configuration and perform the optimization of the one 
left-over FOM over the parameter space under additional 
constraints, requiring that all or some of the remaining 
FOMs are not worse than their established thresholds. 
If the optimization fails, we may need to adjust some 
of the thresholds and repeat the procedure again. This 
may be also the case if the solution found does not en- 
sure an acceptable value for the FOM, which is used in 
the optimization. If the tuning of the thresholds suc- 
ceeds, the solution obtained via the above procedure is 
used as a starting point for further post-processing and 
the corresponding set of values of all FOMs used as a ref- 
erence to compare any other configuration against. The 
post-optimization processing is used to implement some 
additional constraints and/or simplifications, which for 
some reason could not have been imposed on the formal 
optimization procedure. 

Below we present a specific implementation of this gen- 
eral framework in the context of primordial CMB B- 
mode observations by multifrequency multidetector ob- 
servatories in the presence of Galactic foregrounds. In 
this case our FOMs need to account for some effects aris- 
ing due to the component separation procedure, which 
has to be applied to data to recover a genuine CMB sig- 
nal. We therefore start below by discussing a specific 
component separation approach, the so-called paramet- 
ric maximum likelihood technique, and its impact on a 
CMB B-mode detection. 



A. Effects of foreground separation 

An estimation of the presence of the foregrounds in- 
volves two main steps. On the first step, we estimate the 
error incurred while constraining the spectral parame- 
ter values from the data. On the second, we translate 
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that error into some figures of merit expressing the over- 
ah quaUty of the separation process and which are then 
used in our optimization procedure. 



Formalism 



To estimate the spectral parameters we will use a pseudo 
(or profile) likelihood [13] given as, 

-2ln C=-Y^ {AN-^dpY (A^N-^A)"^ AN-Mp.(3) 



Hereafter we use the parametric maximum likelihood 
component separation approach implemented as in [13]. 
We thus assume a linear data model, where a signal mea- 
sured in each pixel p is given by 



A Sp + Yip , 



(1) 



where for each pixel p. 



• dp is a multifrequency data vector with each entry 
corresponding to a different frequency channel; 

• Sp is a multicomponent sky signal vector each entry 
of which corresponds to a different sky component 
and which is to be estimated from the data; 

• A is a mixing matrix defining how the components 
need to be combined to give a signal for each of the 
considered frequency channels; and 

• Hp is a vector containing the instrumental noise and 
assumed to be Gaussian and uncorrelated with a 
dispersion given by N. 

Here both A and N are assumed to be pixel independent 
for simplicity, with an exception of Sec. IV D 7. 

In the parametric approach, one assumes that A is 
parametrized by a set of spectral parameters, /3, which 
need to be determined together with the sky signal esti- 
mates. The noise level per channel, number of frequency 
channels, etc., are all dependent on instrument proper- 
ties, which thus will affect the results of the component 
separation process and could therefore serve as optimiza- 
tion parameters. Some other effects such as calibration, 
beam sizes, and bandwidths are also typically relevant 
and may need to be included in the modeling. We leave 
a thorough evaluation of those effects for future work, 
while neglecting them in this paper. 

Given values of P and defined instrumental parameters 
we can estimate the component signal using a standard 
maximum likelihood solutions, 



We will refer to this likelihood as the spectral likelihood 
and will identify its peak value with the best estimate 
of the spectral indices and the curvature matrix at its 
peak as the measure of the uncertainties expected for 
the spectral parameter estimation. These will be used 
to construct our figures of merit. 



2. Spectral parameter uncertainty 

The profile likelihood derivatives with respect to the 
spectral parameters can be readily computed and the rel- 
evant formulas are collected in Appendix A. As our pur- 
pose is to gain some insight in the constraining power of 
different plausible experimental setups rather than ana- 
lyze any specific data set we will average over the possible 
noise realization assuming that the noise correlation ma- 
trix, N, is known. Using Eq. (Al) from the Appendix we 
then arrive at 



d\nC 

dp 



{A^^SpY (Asp - Asp) (4) 



(A^N-^ A) ^ AN-^, 



(2) 



for the first derivative. In this equation, as well as ev- 
erywhere hereafter, we will use a hat over a quantity to 
mark that we refer to its true, rather than just an esti- 
mated, value, s is a sky signal estimate in the case of 
the noiseless data and it is defined in Eq. (A7). If the 
data model in Eq. (1) is correct both in terms of assumed 
scaling laws but also a number of components, the first 
derivative in Eq. (4) vanishes for the true values of the 
parameters, /3 = /5, emphasizing that the estimator is on 
average unbiased. Indeed in such a case we have A = A 
and s = s. Under the same assumptions the second order 
derivatives taken at the true values of the parameters can 
be then written as [see Eq. (A9)] 



dp dP' 



= tr 



|[a^^N-i A (A^N-^A) ' A^N-^A,^, - A^^N-^ A,^,] ^ SpS^j 



(5) 



Hereafter we will use the inverse of this matrix to ap- 
proximate the error matrix, X), for the recovered scaling 



parameters, i.e.. 



d"^ ln£ 

WW 



(6) 
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and 2-a contours using gaussian approximation 
1- and 2-a contours for gridded likelihood 




FIG. 1: 1-cr and 2-a contours in the /3dust - Psync space of the 
spectral hkeUhood, £, calculated using Eq. (3) for a random 
realization of the CMB and noise contributions, shaded areas, 
and compared against the Gaussian approximation with a dis- 
persion as given by Eq. (5), solid lines. The former likelihood 
has been recentered at the true values of the parameters. 



In Fig. 1 we show examples of the contours likelihoods, 
Eq. (3), computed for dust and synchrotron spectral in- 
dices for simulated data as described in Sec. Ill and for 
some fiducial nearly full sky experiment. They are com- 
pared with a Gaussian approximation based on the vari- 
ance derived with help of the error matrix, Eq. (6). Gen- 
erally we find a very good agreement. This may break- 
down somewhat in cases with very few pixels when the 
actual spectral likelihoods typically become somewhat 
skewed [13]. Nevertheless, we find that even in those 
cases though the Gaussian approximation may fail to re- 
produce properly the tails of the distributions, its overall 
performance is still rather good. In applications of inter- 
est for this paper a sufficient number of pixels is always 
granted. 

An interesting question is then how the precision of the 
spectral parameter estimation depends on the matrix F. 
The short answer is that given the noise levels the higher 
density contrast of the components, i.e., larger diagonal 
elements of F, the better precision of estimated /3, while 
large cross-correlation terms tend to increase the error. 



3. Residuals 



We note that the spatial morphology of the sky com- 
ponents enter the calculation of the errors only in a form 
of pixel averaged correlations, 

F-]^EM*- (7) 

Moreover, only those of the columns and rows of this 
correlation matrix matter, which correspond to sky com- 
ponents characterized by the scaling laws including some 
unknown parameters. Mathematically, this just follows 
from the fact that only columns of the derivatives of the 
mixing matrix, corresponding to such components 

do not vanish. Physically, this indicates that the com- 
ponents for which the scaling laws are known unambigu- 
ously, e.g., CMB, are subtracted cleanly during the sep- 
aration process and do not affect the result of spectral 
indices estimations. An immediate consequence of this 
is that the resulting expressions are indeed equivalent to 
those obtained while averaging over an ensemble of real- 
ization of noise and CMB signal. 

We note that though our conclusion about the impact 
of different components on the spectral parameter esti- 
mation is general, a simple form of the dependence of the 
latter on the foreground signal morphology is due to our 
simplifying assumption of a pixel-independent noise level. 
In general, the relation is more complex, with noise lev- 
els selective (de)emphasizing the contributions of some of 
the pixels on the sky. Though the formalism developed 
here is general and can be straightforwardly adopted to 
a case of arbitrary and correlated noise it can quickly be- 
come computationally heavy. We will therefore leave a 
discussion to more complex and realistic cases for future 
work. 



From the discussion in the previous section it is clear 
that the precision of the spectral parameters determina- 
tion though relevant is clearly not a single factor impor- 
tant in quantifying the component separation effects on 
the 5-mode science. This is due to the fact that better 
precision is usually related to a higher foreground con- 
trast and vice versa making it not straightforward to infer 
an effective foreground contribution left over in the CMB 
map after the separation process, given just the spectral 
indices errors. However, given the estimated value of the 
spectral parameters, /3, we can always calculate the level 
of the foreground residuals, i.e., a mismatch between the 
estimated and true sky components. This can be ex- 
pressed as follows [10], 

A = s - s = (Z - I) s, (8) 

where 

Z(/?) = (A*(/?) N-i A (/?))-' A*(/?) N-iA(^), (9) 

I is a unit matrix and, as usual a hat over a quantity 
denotes its true underlying value. 

The foreground residuals left in the CMB map are just 
one component of the vector. A, which for definiteness is 
assumed to be the zeroth one. We will now restrict our- 
selves to the CMB component and linearize the problem, 
assuming that the errors in spectral parameter determi- 
nation are small. We thus obtain 

where 

4 ^ (11) 
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and we assumed that the CMB component is stored as 
first (i.e., with an index equal to 0) in the component vec- 
tor, s. We can now characterize the level of the residuals 
either simply by its rms value or, in a more informative 
way we can estimate the noise average (though noiseless) 
foreground residual power spectrum, which reads 

Cf ^^^S,,,a°^a°/Cf . (12) 

Given that as mentioned before (see also, [10]) no CMB 
signal is left in the CMB map residuals, which combine 
just the foreground signals, the noise ensemble averages 
coincide with those made over a full CMB + noise set 
of realizations. Clearly to compute the residual spectra 
we need to make assumptions concerning the spatial mor- 
phology of the considered foregrounds, i.e., the knowledge 
beyond the F matrix defined earlier. This is reflected in 
Eq. (12) by the presence of true auto- and cross- spectra 
for each considered foregrounds, &^ . However, the F 
matrix provides a sufficient description necessary to cal- 
culate the rms value of the residuals. This can be seen 
noting that 

A?r' = E E ^^^'O^o^'Fjr- (13) 

In the following we will use the Cf" quantity to construct 
our FOMs making some specific assumptions about the 
foregrounds spatial properties as described in Sec. III. 
We point out that the formulas presented above are just 
a special case of those already studied in [10]. The im- 
portant difference is however that the spectral indices 
uncertainties used in this work are computed effectively 
as the full CMB + noise, ensemble averages rather than 
derived in a single, particular study case as in that pre- 
vious work. 



B. Figures of merit 

Given the estimates of the foreground residuals pro- 
vided in the previous Section, we can now define our 



figures of merit. Hereafter, we will use three FOMs: two 
referring to the effects of the foreground residuals found 
in the recovered CMB map as a consequence of the sep- 
aration process, and the third related to the noise level 
of that map. As our scientific goals here are related to 
the primordial B-mode signal two of the proposed FOMs 
express the effects of the foreground residuals on a tensor- 
to-scalar ratio (of the respective CMB spectra), r. The 
third one is more generic and is just to ensure that the 
least-noisy map of the sky is produced. 

a. FOM#l.' Tstat - an r value detectable on 95% 
confidence level incorporating the component separation 
uncertainties. 

This FOM is computed in two steps. First, we use a 
generalized Fisher matrix expression to estimate the un- 
certainty of estimating the tensor-to-scalar ratio, r, for 
any given assumed r value, and subsequently we deter- 
mine a value of r = rstat^ which is detectable on 95% 
confidence level. This limiting value is defined as 



rstat^2F-^/^rstat)- (14) 



The Fisher matrix we propose to use here accounts 
for usual cosmic, sampling, and noise variance, but also 
for an extra error resulting from the shortcomings of the 
foreground component separation, which is presumed to 
be applied to the maps beforehand. We model the sep- 
aration residuals following the formalism introduced in 
Sec. II A 3 and which treats the map-level residuals as 
a linear combination of the foreground templates with 
Gaussian distributed amplitudes. 

The detailed derivation of the Fisher formula is pre- 
sented in Appendix B. Recalling that denotes the 
power spectrum of the residuals, the final expression for 
the Fisher matrix, F^^, reads then 



{2i + 1) 6u^ 

2 fsky 



1 + E (2^" + 1) 



{21 + 1) (2f + 1) CP 



2CjCl 1 + E (2^" + 1) 



dr 



(15) 



where for shortness we set = C^^^ -h C^^*^^. affect both the noise level as quantified by C^^^^^ but 

also the level of residuals resulting in different rstat values 

A choice of experimental parameters will in general 
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derived for different proposal configurations. 

We note that if the level of residuals is very high 
as a result of the errors on spectral parameters be- 
ing large then the first order expansion used to obtain 
Eqs. (10) and (12) may not be any more sufficient. Like- 
wise, if the foreground contributions are large so their 
residuals are comparable to the CMB signal, sufficiently 
precise knowledge of the foregrounds would become nec- 
essary to ensure that the above formulas produce reliable 
results. As one may not be completely comfortable with 
such a presumption, we will introduce another FOM de- 
signed to penalize such configurations. 

b. FOM#2; rg// - an effective r value of the fore- 
ground residuals. 

We use a proposal of [12] and we characterize any ob- 
tained foreground residuals using its effective value of r 
defined as 

s{reff) 0^ u, (16) 

where 

s(r) ^ Y.cr\r)-cr\Q), 

i 

£-max 

i 

We note that due to a missing factor of 2-^ + 1 this crite- 
rion does not compare power contained in the primordial 
B spectrum with that of the residuals (up to ^max)^ and 
in contrast to the latter it gives more weight to low mul- 
tipoles. 

c. FOM#3; crg^g - noise level of the recovered 
CMB map. 

When the true values of the spectral parameters are avail- 
able the only uncertainty of the recovered component 
maps, Eq. (2), is due to the instrumental noise and reads 

M = (A^N-^A)"', (17) 

and therefore depends on the number of detectors and 
frequency channels. With our focus on the CMB we will 
therefore use the diagonal element of J\f corresponding 
to the CMB component as one of our criteria, which we 
would like to keep as low as only possible. We thus have 

(aS^^ff)'^A/oo. (18) 

We note that only when A is a unit matrix the 
above formulas corresponds to a standard, inverse- noise- 
coaddition. This in turn can only happen if no sky com- 
ponents are mixed together, implying no foregrounds. In 
any other case the final noise of the CMB map is higher 
than the inverse noise weighting would imply [14] and its 
exact value will depend on the details of the component 
scalings and experimental set up. We note that unlike 
two other FOMs implemented here this applies on a map 
rather than a power spectrum level. Moreover, as the 



spectral parameters, /3, are assumed to be known ahead 
of the computation, this FOM may lead to configura- 
tions in which the estimation of those is not feasible and 
thus rendering the residuals effectively arbitrary and un- 
known. Nevertheless, though it needs to be used with a 
care, it provides a meaningful reference against which to 
gauge other configurations. 



C. Optimization procedure 

1. Parameters and optimization approaches 

In this work typically we will optimize a number of 
detectors in each of the pre-defined frequency channels. 
This is clearly one of the most basic hardware parameters 
one would like to know designing a B-mode experiment. 
Though the central frequency of the channels is often 
constrained from the onset by some hardware constraints, 
we will also consider more general optimization problems 
in which a number of frequency channels, their central 
frequencies, and a number of detectors per channel are 
all to be optimized with respect to. 

In the former case we perform a single global optimiza- 
tion operation. Our numerical codes use a minimization 
algorithm for constrained nonlinear multivariate func- 
tion, as implemented in matlab, which is based on a 
line-search algorithm with constraints introduced via a 
quadratic approximation to the Lagrangian function. 

In the second type of the optimization problems we 
have found that attempts of performing a global opti- 
mization are often frustrated by numerical issues and 
the results are consequently not very reliable. Instead 
we have devised a multi-step approach which is shown 
schematically in Fig. 2. In the proposed method we start 
from a configuration consisting of a focal plane overpop- 
ulated with a large number of mock channels uniformly 
covering the requested interval of frequencies. Each of 
these channels is assigned the same number of detectors 
or a fraction of the focal plane area, depending on which 
hardware constraint we use (step 1). We then optimize 
the number of detectors or focal plane area as in the stan- 
dard case with the fixed frequency channels with respect 
to a given FOM (step 2). As the obtained detector distri- 
bution is typically rather inhomogeneous we then merge 
the channels with close central frequencies, e.g., closer 
than the expected band-width of the anticipated chan- 
nels. In the process of merging we replace some subsets 
of channels by a new channel, centered at the bary center 
of the previous frequencies as weighted either by a num- 
ber of detectors or focal plane assigned to each of the 
merged channels, and assign to it either their detectors or 
the corresponding focal plane area (step 3) . We optimize 
this new configuration again with respect to numbers of 
detectors per channel, and go back to step 2 whenever 
the resulting configuration is found very inhomogeneous. 
Then we repeat this process again. We find however that 
usually a single pass over the optimization sequence pro- 
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FIG. 2: Schematic illustration of our optimization procedure 
in a case of an adjustable number of channels, a number of 
detectors per channel, and their central frequencies. 

duces satisfactory results. 

2. Constraints 

The constraints can be imposed straightforwardly via 
Lagrangian multipliers therefore permitting a wide vari- 
ety of those, which can, and sometimes have to, be intro- 
duced. 

These include some trivial constraints stemming from 
the physical interpretation of the optimized parameters, 
e.g., ensuring non- negative values for detector numbers 
or focal plane area, which have to be usually included 
explicitly. 

There are also some fundamental constraints without 
each the convergence could not be reached at all. These 
typically followed from the hardware restrictions. As an 
example of the hardware constraint, hereafter we will use 
either a constraint on a total area of the focal plane or on 
a total number of detectors, corresponding to cases where 
we have full freedom to fill in the entire focal plane as 
densely as only needed or when such freedom is restricted, 
for instance, by capability of our read-out systems. 

Yet another type of constraints invoked in the opti- 
mizations studied here includes those driven by the sci- 
ence goals rather than hardware requirements. For in- 
stance, we could require that some specific frequency 
channel map has a noise level better than some pre-set 
level in order to make such a map good enough to investi- 
gate some sky objects or features of interest. These kinds 
of constraints are often needed in the post-processing 
phase described later. 

In addition, while considering multiple FOMs simulta- 
neously we will typically use some of them as constraints 



restricting the optimization to such configurations for 
which the required values of these FOMs is better than 
some suitable threshold. 



D. Post-optimization processing 

The optimized solution formally determined as de- 
scribed here in most of the cases will require further ad- 
justments and tuning, before it could become a basis for 
an actual instrument design and later its potential devel- 
opment. 

Specific instances of such post-optimization processing, 
which we consider hereafter include: 

• design simplification - including either rounding 
of numbers of detector per channels and/or remov- 
ing some channels altogether, in particular those 
assigned a small number of detectors. 

• addition of some ad hoc frequency channels 

- for instance, either to improve the overall robust- 
ness of the derived configuration with respect to 
potential surprises concerning physical properties 
of the foregrounds, or to extend the science goals 
beyond what is already encoded in the FOMs. 

In all these cases a crucial question is how significant 
modifications from the initial optimized setup are allowed 
before the science goals, as expressed by the FOMs, are 
compromised too significantly to be acceptable. Below 
we outline a general approach devised to answer such 
questions in some specific cases relevant to the applica- 
tions considered here, leaving a more detailed descrip- 
tion of its practical implementation in our study cases to 
Sec. IV. 



1. Detector number rounding 

Let us consider only channels for which the optimiza- 
tion procedure has assigned a nonzero number of detec- 
tors. Moreover we start from the channels for which we 
want to decrease a number of detectors, as a result of the 
rounding procedure, and postpone the treatment of the 
remaining ones for later. For the time being we also relax 
all the constraints imposed on the optimization, with an 
exception of the ones ensuring positivity of a number of 
detectors or focal plane area. Removing some of the de- 
tectors decreases the instrument sensitivity and thus will 
affect our science goals, unavoidable rendering the exper- 
iment less competitive. For any specific configuration we 
can always calculate exactly its performance in terms of 
the adopted FOMs. However, on the experiment design- 
ing stage, when many such configurations may need to 
be considered and often quickly discarded, the need for 
the case-by-case computation may be a hinderance. In 
such a context a fast, even if rough and approximate, ap- 
proach could be therefore a handy substitute permitting 
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one, on the one hand, to zoom quickly on an interesting 
family of potential solutions, and, on the other, to reject 
configurations which are clearly of no interest. One way 
to address such a need could be to construct, for each 
FOM, a series of hyper- volumes, V/c, (/c = 0, ny — 1), 
centered on the optimized configuration and such that 
Vo C Vi C ••• C Vnv-i- To each volume, V/c, we can 
assign uniquely a value, Vk-, such as, 

Vk= min {YOMi{di})\, (19) 

i.e., which defines the worst performance plausible within 
the volume. The values Vk are directly arranged in a de- 
scending order given that any volume contains all the 
previous ones. If now a configuration of our interest be- 
longs to the k-th. volume and does not to the {k — 1)- 
th one we immediately can infer that its performance, 

expressed in terms of the given FOM, is bracketed by 
the two values corresponding to these two hyper- volumes, 
i.e., Vk-i <v<Vk- 

Two features are essential to make such a scheme use- 
ful. First, we have to have an easy way to identify 
whether a given configuration is or is not contained in 
a given hypervolume. Second, the volumes have to be 
defined in such a way that the values of Vk assigned to 
them span a range of interesting values and do so suf- 
ficiently densely. Given potential high- dimensionality of 
the parameter space we consider here, none of these two 
requirements is straightforward to satisfy. To address the 
first of them we propose to use as the volumes hyperel- 
lipsoids defined as 

v.^iwiE ^'^^'dir' ^i'^^ (20) 

where the last condition on the right hand side narrows 
the volume to the cases of our interest here. The semi- 

(k) 

axes of the ellipsoid, cr) , need to reflect the fact that the 
rate at which the given FOM changes will be in general 
different in different directions in the parameter space. 
We therefore determine them for every direction corre- 
sponding to varying detector numbers in a single chan- 
nel separately and we do it for each channel of relevance 
here, i.e., for which d^^^ ^ 0. The procedure here involves 
two steps. First, we select a grid of values of the consid- 
ered FOM, Vk, which covers the range of its values of 
our interest and does that with a sufficient density. This 
grid is used consistently for all directions and channels. 
Subsequently, for every channel, i, we find numerically 
a dependence between a value of FOM and a distance 
from the optimized solution along i-th axis of the pa- 
rameter space and use this relation to determine a^^'' so 

Typically, the grid point values, Vk, 
will provide a good approximation to the worst case val- 
ues, Vk: defined earlier. The latter are therefore expected 
to be automatically well-spaced and to span a sufficient 
interval of FOM values. In actual applications, we com- 
pute more precise estimates of Vk than those provided by 



Vk. This is done by using Eq. (19) and randomly sam- 
pling the volume of the corresponding hyperellipsoid. 

The proposed construction therefore obeys the two re- 
quirements we defined earlier and provides a quick and 
easy way to find out how far the configuration can be 
tweaked, without compromising the science goals. The 
parameters a-^^ and Vk constitute an additional and im- 
portant piece of information, which should be determined 
and provided alongside any optimized configuration to 
render the optimization process helpful. We demonstrate 
this in actual applications in Sec. IV D 5. 

So far we have neglected the hardware constraints. 
Those would require that any subtraction of the detec- 
tors from some of the channels needs to be accompanied 
by adding detectors somewhere else. However, as adding 
detectors can only improve our FOMs, the procedure out- 
lined above is conservative as the final outcome of the 
rounding with the constraints fulfilled can be only better 
than what the procedure implies. 

We can now get back to the channels for which we 
might have wanted to round up the number of channels. 
This can be done but only by appropriately distribut- 
ing the detectors we have removed earlier, as the overall 
hardware constraint has to be fulfilled. If we do not have 
however strong preferences regarding their distribution 
we may try to perform a second round of the optimiza- 
tion to find out how it can be done in an optimized way. 
This could be done by solving the optimization prob- 
lem as the initial one but adding extra constraints fixing 
the number of detectors to their rounded value in all the 
channels, where the rounding has been applied. 



2. Low-populated channels 

The formal optimization procedure proposed here may 
result in configurations, which include a number of chan- 
nels with a relatively low number of detectors. As extra 
frequency channels contribute to an overall complexity of 
the instrument, it could be advantageous to remove those 
if there is no strong science driver behind them. Remov- 
ing entire channels is more delicate than a removal of 
some fraction of the detectors as discussed above. This 
is because it can render the separation process singular 
or nearly so with separation errors growing rapidly. The 
singularities however can be usually avoided by keeping 
track of a number of channels needed to separate some 
specific number of components, each described by a well- 
defined number of parameters. We will therefore assume 
throughout that this is indeed the case. We then pro- 
ceed as follows with the underpopulated channels. We 
remove such a channel or contiguous group of those and 
either redistribute the extra detectors between the ad- 
jacent channels or create a new channel with a central 
frequency computed as a detector (or focal plane area) 
weighted average of the frequencies of the channels to 
be replaced. We then test the change in the FOM val- 
ues. If either of the options is not satisfactory, we can 
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try to further to improve on it by performing formal op- 
timization but now using only channels which contain a 
nonzero number of detectors. If that still turns out to be 
much worse than the optimized values of the FOM, we 
subsequently need to identify, which of the low-populated 
channels are crucial from the performance point of view 
and retain them in our final configuration, while remov- 
ing or merging the others. 

3. Ad hoc extra channels 



tion then is how robust the science goals posed for the 
experiment are assuming that the target has been de- 
fined using the procedure described here. We address 
this problem in a specific case in which we admit some 
failure rate for the detector production process, e. For a 
set of realistic values of e we perform a random sampling 
of the parameter space randomly drawing a number of 
failed detectors. We then evaluate the full set of FOMs 
for each of the samples and find what is an average, likely 
on 95% confidence level impact of the considered failure 
rates on the FOM values. 



Clearly our optimized configuration is only as good as 
the foreground model assumed in the optimization pro- 
cess. The impact of some of the uncertainties in the fore- 
ground modeling can be discussed directly within the for- 
malism presented here as, for example, that of details of 
the foreground correlation matrix and/or shape of their 
power spectra. It is more difficult however to investigate 
the role of our assumptions about a number of spectral 
parameters and/or a number of foreground components. 
In that respect one may feel more at ease with the con- 
figurations, which have the entire frequency range acces- 
sible to the instrument sufficiently populated, as they, at 
least on the intuitive level, may appear more robust with 
regard to the unknown. 

If the optimization does not lead to a configuration, 
which satisfies such a condition on its own, one may 
want to impose it by adding one or more ad hoc fre- 
quency channels in the areas they are missing. This can 
be done straightforwardly by adding a constraint requir- 
ing at least some predefined and nonzero number of de- 
tectors in those channels. If this number is fixed exactly, 
it will be obviously not anymore a parameter of the op- 
timization, however the channel will still take part in the 
optimization process as it will be taken into account in 
the FOM computation. We use this approach to answer 
an important question, i.e., how close such a new config- 
uration would perform as compared to the original, opti- 
mized one. In other words, should the foreground model 
used turn out to be correct, would we lose much by try- 
ing to make the configuration more robust ? Ideally, the 
loss of performance will not be significant, permitting us 
to reach both these goals simultaneously: near optimal- 
ity whenever our modeling is correct, and ability to meet 
the surprises. In Sec. IV D 5 we discuss how the parame- 
ters of such ad hoc channels can be proposed in a specific 
application. 



E. Design robustness 

A problem closely related to the one discussed above 
is that of the robustness of the final configuration. Given 
some unavoidable failure rates in a technological process 
involved in the instrument design and development, a 
final version of the instrument typically comes short of 
the actual design target. An important and valid ques- 



III. FOREGROUND MODELLING 

As discussed earlier in our formalism there are two 
key quantities needed to describe completely the effect 
of foregrounds. These are the auto- and cross- spectra 
characterizing the spatial distribution of the foreground 
components and the component correlation matrix, F. 
To calculate these we will rely on a specific model of the 
Galaxy and since we are interested in the B-modes, we 
will consider only diffuse foregrounds, synchrotron and 
dust, with known and non- negligible polarization emis- 
sion. 

To simulate these emissions in polarization we imple- 
ment the same recipe as in [10], which starts off from 
deriving reliable total intensity templates from the avail- 
able data (the Haslam map [15] for the synchrotron and 
the combined COBE-DIRBE and IRAS for the dust [16]), 
rescales them using some constant overall polarization ef- 
ficiency factor, fixed to 10% in order to match the large 
scale E and B spectra of [17], therefore producing po- 
larization intensity templates. The polarization angles 
on the largest scales are then determined using a combi- 
nation of the WMAP data and three-dimensional mod- 
eling of the Galactic magnetic field as in [17], while on 
the small angular scales ( ^ 1°), by randomly simulating 
those using their angular power spectra as derived from 
the data [18]. 

We assume spatially constant frequency scalings: a 
power law with index = ~3 for the synchrotron, i.e., 

^sync (Z^, ^ref) = i ) (21) 

and a uniform greybody scaling law, as in Model 3 of 
[19], 

f V \ ''""^^ exp ^"""^ - 1 

^dust(.,^.e/)=(— ) —1^' (22) 

\^ref J kTd ^ 

where Td = 18.0 K and /3d = 1.65 for the dust. 

As pointed out in [10], by adopting this model a large 
amount of correlation is expected between dust and syn- 
chrotron both because the Galactic magnetic field is a 
common ingredient and because of the lack of high res- 
olution data that forces us to extend the correlation to 
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FIG. 3: Three foreground masks as used in this work. Yel- 
low (largest mask), dark red (large mask round the galactic 
bulge), and dark blue (narrowest mask around the galactic 
plane) mark sky areas excluded from the masks: Mask II, 
P06, and Mask I, respectively. 



Mask 


fsky Fdust — dust F^ust — sync Fsync — sync 


P06 mask 
Mask I 
Mask II 


0.73 3.20 0.082 0.0025 
0.82 1.12 0.029 0.00084 
0.51 1.74 0.053 0.0019 



TABLE I: F matrix elements computed for two foreground 
components, dust and synchrotron, at the fiducial frequency 
of 70 GHz for the three masks used in this work and all pix- 
elized using healpix scheme with nside =128. 
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FIG. 4: Pseudopower spectra of the foreground templates 
for the three different masks considered in this work and 
contrasted with the CMB B-mode power spectrum. For 
each mask the three lines show dust (solid line), synchrotron 
(dashed line), and their cross-correlation (dotted line). The 
foreground signals are computed at the 65 GHz. All the spec- 
tra used in this work are computed from HEALPIX-pixelized 
maps with nside = 512. 



IV. APPLICATIONS 



small scales. This is reflected in the fact that the off- 
diagonal terms of F are of the same order of the diagonal 
terms. However, as we discuss in Sec. II A2 large off- 
diagonal terms inflate the errors on spectral parameters, 
so from the perspective of foreground residuals the em- 
ployed model can be considered conservative. 

To investigate the effects of different foreground con- 
trasts and morphology we consider here three different 
sky masks. Mask I and Mask II are tailored in such 
a way that they have the possible total polarized fore- 
ground contrast (synchrotron plus dust) lower than a pre- 
defined threshold equal to 0.86 and 0.36 /iK, respectively. 
We also employ more standard the P06 mask from the 
WMAP team, which is optimized for the low frequency 
coverage of WMAP, i.e. it is skewed toward cutting out 
more the synchrotron than the dust emission. All three 
masks are shown in Fig. 3 and their corresponding fore- 
ground (pseudo) power spectra are displayed in Fig. 4. 
In addition, in Table I we list the elements of the matrix 
F for each of them. 

These masks are thought to be applied a posteriori to 
the full sky map, assumed to be homogeneously observed 
by the experiments. This means that the noise level per 
pixel, described in Sec. IV B, will be the same for each of 
them and thus the results of the F0M#3 optimization 
will be the same in all three cases. 



As an illustration of the method detailed in the pre- 
vious sections, we will consider the optimization of two 
different full sky satellite designs: Cosmic Origins Ex- 
plorer (COrE) proposed in response to the European 
Space Agency Cosmic Vision 2015-2025 Call [3], and 
CMBpol [4, 20], proposed as part of the NASA mission 
concept study. The respective frequency channels and 
a number of detectors per channel corresponding to the 
original designs are summarized in Table II for CMBpol 
and in Table III for COrE. In our analysis we will assume 
the same noise levels per detector for each of the exper- 
iments, Sec. IV B, and that they scan the sky homoge- 
neously with all the detectors observing simultaneously 
over the course of 4 years. Everywhere in this paper, 
but in Sec. IV E, we will aim at optimizing a number 
of detectors per channel, assuming that the latter are 
fixed and known, and keep either the effective area of 
the focal plane or total number of detectors constant. 
The assumed values for the two constraints are derived 
given the proposed configurations of COrE (Table III) 
and CMBpol (Table II). In the case of the focal plane 
area we assume that an area of the focal plane occupied 
by a single, diffraction-limited detector operating at fre- 
quency, can be expressed as 

(23) 
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Frequency [GHz] 


30 


45 


70 


100 


150 


220 


340 


500 


850 


Number of detectors 


84 


364 


1332 


196 


3048 


1296 


744 


938 


1092 



TABLE II: CMBpol distribution of detectors among the different channels. 



Frequency [GHz] 


45 


75 


105 


135 


165 


195 


225 


255 


285 


315 


375 


435 


555 


675 


795 


Number of detectors 


64 


300 


400 


550 


750 


1150 


1800 


575 


375 


100 


64 


64 


64 


64 


64 



TABLE HI: COrE distribution of detectors among the different channels. 




FIG. 5: Breakdown of the focal plane area between the fre- 
quency channels as originally proposed for the COrE, left, 
and CMBpol, right, satellites. In the case of COrE all the 
channels with frequencies larger than 250 GHz represent less 
than 10% of the total focal plane area. 



The total focal plane area is then obtained by summing 
over the contribution coming from all the detectors. We 
note that this gives at the best some effective area be- 
cause we do not take into account any kind of filling fac- 
tor, which is usually driven by technical constraints such 
as the shape of the detectors, the wiring, etc. Fig. 5 
shows the fractional area as occupied by each channel in 
the case of the proposed versions. 

Hereafter we neglect the effects of the E-B leakage, 
e.g., [21], both in the calculations of the foreground spec- 
tra as well as the CMB variance. In the former case 
this is justified given the fact that E and B spectra for 
foregrounds are on comparable levels and the leakage is 
usually harmless. For the CMB variance we assume that 
the effects of such a leakage can be largely removed using 
one of the methods proposed in the literature. Though 
corrections of this sort usually lead to some extra preci- 
sion loss, this is typically only a fraction of the standard 
cosmic variance and, at least for experiments with a suf- 
ficiently large sky coverage, small enough not to change 
our results in a significant way. For small-scale observa- 
tions the effect may not be negligible and should be taken 
into account, e.g., [10, 21]. 

For some alternative analyses of performance of these 
two experiments see, e.g., [8, 9, 14]. 



A. Mixing matrix 

To define the mixing matrix, Eq. (1) relevant for the 
problem at hand, we will use the component frequency 
scaling laws as defined in Sec. III. We set the refer- 
ence frequency, i.e., frequency at which all the compo- 
nent maps are recovered as equal to 150 GHz. We also 
account for frequency band-shapes. For this we will as- 
sume that they are top-hat-like with a width equal to 1/3 
of the central value. Therefore, 
mixing matrix will be given as 



an element, Aij of the 



(24) 



where Ui is a frequency of the i-th channel, (y^Vref) 
is a photon flux as measured at frequency v relatively to 
Vrefi cind Wth{--, ctth) is a top hat window centered at 
and with a width aTH- As mentioned earlier we assume 
hereafter that the scaling laws adopted on this stage coin- 
cide with the true ones modulo the unknown parameters. 
Nonetheless we will limit the frequency range of the chan- 
nels included in our discussion below to between 30 and 
400 GHz, to, on the one hand, avoid channels where the 
CMB is completely swamped by the foregrounds and, on 
the other, not to stretch the adequacy of the frequency 
scaling model of the dust over a too broad interval. 



B. Noise levels 

We assume sky-noise limited detectors. Their noise 
level, in antenna units, is taken to be independent on 
a detectors operating frequency and set to be equal to 
at ^ 30 flK^/s [3]. A single detector noise level per 
pixel will then be given by an observation total length, 
Tobs and pixel area. The detector noise per channel will 
also depend on a number of detectors operating at a given 
frequency. The numbers of detectors for each channel, 
{di}(i=o n/-i)' parameters we will be most fre- 

quently trying to optimize in the reminder of this paper. 
The noise correlation matrix will be then assumed to be 
diagonal and the diagonal elements will be given by 



^obs di 



(25) 



Here, A/"*^* is a total number of observed pixels (to be dis- 
tinguished from Npix a number of pixels included in the 
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analysis {Npi^ will depend on the mask we will consider; 
see Sec. III). 



C. Resolution 

So far we have ignored completely the fact that detec- 
tors operating at different frequencies will likely have a 
different resolution, in particular if they are diffraction- 
limited. Because the parametric maximum likelihood 
component separation approach adopted here is pixel- 
based all the channel maps will have to be however 
smoothed to some common resolution before the separa- 
tion can be accomplished. The extra smoothing required 
here is not generally lossless and may introduce noise 
correlation between the pixels. Hereafter we will ignore 
such effects and keep using Eq. (25) to compute the noise 
levels with only the pixel size, and thus a number of pix- 
els, adjusted accordingly. As far as the sky signals are 
concerned, given that our science goals are mostly con- 
strained by the large angular scales, we will mimic the 
common resolution by setting a hard limit on the consid- 
ered value of £ to be irnax = 500, as we have found that for 
the considered noise levels there is no information beyond 
that range. We note that in a more refined approach one 
may want to introduce the resolution as an optimization 
parameter and constraint it by requiring that the gain 
due to its decrease is larger than some threshold. All 
the power spectra used in this work have been derived 
using HEALPIX pixelized maps with the healpix resolu- 
tion level, nside = 512. This is clearly sufficient given 
the hard ^-space cut off we have adopted here. We stress 
that this resolution is higher than the one used in Sec. Ill 
for the determination of the matrix F. This is because in 
the latter calculation only pixel-domain quantities are in- 
volved, which are overwhelmingly dominated by the large 
scale fluctuations for which nside =128 maps are entirely 
sufficient. 



D. Fixed number of channels with pre-defined, 
fixed frequencies 

In this Section, we describe the optimization of the 
two experiments, assuming that the frequency channels 
are fixed ahead of the procedure. The results are sum- 
marized in Tables IV and V for COrE and CMBpol, re- 
spectively, and for each FOM (called there for shortness 
as Fl, F2 or F3), three considered sky masks (P06, Mask 
I or Mask II), and two hardware constraints (total area 
or total number of detectors), and are contrasted with 
results obtained for the original designs of the experi- 
ments, as shown in the rightmost columns of the Ta- 
bles. We note that though the latter configurations are 
mask-independent, the corresponding FOMs values dif- 
fer somewhat from mask to mask due to differences of 
the sky included in the analysis. For each of the opti- 
mized configurations the tables show a corresponding to- 



tal number of detectors, focal plane area, effective noise 
levels, spectral index determination precision, and values 
of the three FOMs. A selection of these results is also de- 
picted in Figs. 6-9, showing, as bars, a number of detec- 
tors for each of the considered channels, left panels, and 
power spectra of the residuals corresponding to each con- 
figuration, right panels. The visualized cases are those 
based on the P06 mask, however the other cases would 
look similar. In each Figure the upper left panel shows 
a corresponding original configuration followed by three 
panels displaying configurations optimized with respect 
to each of the three FOMs. Four general observations are 
in order here. 

1. The optimized configurations depend on the FOM 
used for the optimization. 

2. The constraints imposed on the problem affect the 
results. Constraining the focal plane area gives 
preference to the high frequency channels with de- 
tectors occupying a small area and thus leads to 
a worse determination of the synchrotron signal, 
which in turn leads to a higher level of residu- 
als, if these are left unconstrained, i.e. in cases 
of F0M#1 and F0M#3. Also the overaU noise, 
F0M#3, tends to be higher. 

3. The final configurations obtained for each of the 
three masks are essentially identical, though the 
actual values of FOMs do differ mostly due to a 
different number of pixels with Mask II containing 
the fewest of those. 

4. The optimized configuration contain significantly 
fewer frequency channels than allowed for in the 
optimization and therefore fewer than proposed in 
the original versions of the both these experiments. 

Below we comment on some of the result in more de- 
tail and leaving a general discussion for the conclusions. 
Sec. V. 

1. F0M#1 optimization - Vmin 

For all configurations shown in Tables IV and V for 
which F0M#1 could be computed, i.e., those containing 
more than just 3 channels, r^nin is found to be on the 
order of 10 ~^ and varying from case to case by no more 
than a factor of 2. This is also the case for the original 
designs of the COrE and CMBpol satellites. The values 
of F0M#1 optimized under the constraint of the total 
number of detectors tend to be somewhat better (worse) 
than those derived under the total focal plane area con- 
straint for COrE (CMBpol). The differences are however 
small across the board and probably irrelevant in prac- 
tice. 

In both the COrE and CMBpol cases, the optimization 
of F0M#1 leads to configurations for which also F0M#3 
is close to the optimum, as the latter is found to be within 
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FIG. 6: Left: Optimized distributions of numbers of detectors per channel derived under the total focal plane area constraint 
for the COrE satellite, including only channels below 400 GHz. From top to bottom we show first the original distribution 
followed by the three optimized ones derived using F0M#1 to #3, respectively. Right: Corresponding power spectra of the 
residuals and the noise computed for the optimized configurations shown on the left and compared against the spectrum of the 
CMB 5-modes with r = 0.001. 




FIG. 7: As in Fig. 6 but imposing the constraint on the total number of detectors. 



5-10% of its best value for the respective hardware con- 
straints. This suggests that this is the variance due to 
the noise rather than the foreground residual, which con- 
tributes to the recovered value of the F0M#1 more sig- 
nificantly (see also [10]). Conversely, as a consequence 
in such cases the level of the foreground residuals is not 



tightly controlled and therefore the FOM#l-optimized 
configurations result in values of F0M#2, which are at 
least 1 order of magnitude above the best achievable Tg//, 
and worse than the values derived for the proposed de- 
signs. As we normally would prefer to avoid too high 
residuals we conclude that F0M#1 is not sufficient as a 
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FIG. 8: As in Fig. 6 but for the CMBpol satellite. 
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FIG. 9: As in Fig. 7 but for the CMBpol satellite. 



stand-alone optimization criterion and preferably should 
be combined with some other indicator, efficient in en- 
forcing the low value of the residuals. We will get back 
to this issue later on in this Section. 



2. F0M#2 optimization - Veff 



From Eqs. (10)-(12) it follows that a good determina- 
tion of the spectral parameters Pdust and fSsync is nec- 
essary and sufficient to ensure a low level of the fore- 
ground residuals. We therefore expect (see also [12]) 



that in the FOM#2-optimized configuration the detec- 
tors should populate predominantly low frequency bands, 
which are dominated by the synchrotron signal, the CMB 
band, and high frequency bands, dominated by the dust. 
As we require at least 4 channels in the case at hand 
to avoid problem singularity and impose the hardware 
constraint the actual answer is somewhat more complex, 
nevertheless the overall detector distribution conforms 
with the above intuition. Indeed the FOM#2-optimized 
configurations include channels below 50 GHz, around 
100 - 130 GHz, and above 250 GHz. This applies for 
both the experiments and for every mask. The details of 
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channels 






P06 mask 








mask I 






mask II 




proposed vers 


ion 


Constraint 






area 






total # 








total # 


area 


total # 








(GHz) 


Fl 


F2 


F3 


Fl 


F2 


F3 


Fl 


F2 


Fl 


F2 


Fl 


F2 


Fl 


F2 


P06 mask 


mask I 


mask II 




45 


45 


22 


48 


610 


87 


382 


45 


21 


610 


72 


45 


22 


607 


88 


64 


- 


- 




75 


- 


370 


- 


1775 


827 


- 


- 


366 


1775 


778 


- 


37 


1759 


832 


300 


- 


- 




105 


- 


- 


- 


3027 


- 


4876 


- 


- 


3026 


- 


- 


- 


3042 


- 


400 


- 


- 




135 


3160 


1872 


3918 


- 


2313 


- 


3161 


1886 


- 


2322 


3124 


1871 


- 


2315 


550 


- 


- 




165 


1092 


- 


- 


- 


- 


- 


1091 


- 


- 


- 


1146 





- 


- 


750 


- 


- 


Number of 


195 






























1150 


- 


- 


detectors 


225 






























1800 


- 


- 




255 


- 


- 


- 


- 


2081 


- 


- 


- 


- 


2141 


- 


- 


- 


2073 


575 


- 


- 




285 




4623 


- 


- 








4669 


- 


- 


- 


4610 


- 


- 


375 


- 






315 






























100 


- 






375 


3281 


3186 


2859 


717 


820 


870 


3281 


3156 


717 


816 


3294 


3188 


719 


820 


64 


- 




Total area 




0.023 


0.023 


0.023 


0.081 


0.032 


0.057 


0.023 


0.023 


0.081 


0.031 


0.023 


0.023 


0.080 


0.032 


0.023 


- 




^ [number of dcteetors] 




7579 


10073 


6824 


6128 


6128 


6128 


7577 


10099 


6128 


6128 


7608 


10062 


6128 


6128 


6128 








45 


0.085 


0.042 


0.091 


0.34 


0.12 


0.30 


0.085 


0.040 


0.34 


0.10 


0.085 


0.042 


0.36 


0.12 


0.12 


- 


- 




75 


- 


0.25 


- 


0.35 


0.42 


- 


- 


0.25 


0.35 


0.41 


- 


0.25 


0.35 


0.42 


0.21 


- 


- 




105 


- 


- 


- 


0.31 


- 


0.69 


- 


- 


0.31 


- 


- 


- 


0.31 


- 


0.14 


- 


- 




135 


0.67 


0.40 


0.83 


- 


0.36 


- 


0.67 


0.40 


- 


0.37 


0.66 


0.40 


- 


0.36 


0.12 


- 


- 


Fractional 


165 


0.15 


- 


- 


- 


- 


- 


0.15 


- 


- 


- 


0.16 


- 


- 


- 


0.11 


- 


- 


area 


195 






























0.12 


- 


- 




225 






























0.14 


- 


- 




255 


- 


- 


- 


- 


0.090 


- 


- 


- 


- 


0.097 


- 


- 


- 


0.090 


0.034 


- 


- 




285 


- 


0.22 


- 


- 


- 


- 


- 


0.22 


- 


- 


- 


0.22 


- 


- 


0.018 


- 


- 




315 






























0.0039 


- 


- 




375 


0.090 


0.088 


0.079 


0.006 


0.016 


0.010 


0.090 


0.087 


0.0057 


0.017 


0.090 


0.088 


0.0057 


0.016 


0.0018 


- 


- 




45 


0.37 


0.52 


0.35 


0.099 


0.27 


0.13 


0.37 


0.53 


0.099 


0.30 


0.37 


0.52 


0.099 


0.26 


0.31 


- 


- 




75 


- 


0.13 


- 


0.058 


0.085 


- 


- 


0.13 


0.058 


0.088 


- 


0.13 


0.058 


0.085 


0.14 


- 


- 




105 


- 


- 


- 


0.044 


- 


0.035 


- 


- 


0.044 


- 


- 


- 


0.044 


- 


0.12 


- 


- 




135 


0.044 


0.057 


0.039 


- 


0.051 


- 


0.044 


0.056 


- 


0.051 


0.044 


0.036 


- 


0.051 


0.10 


- 


- 


Noise 


165 


0.074 


- 


- 


- 


- 


- 


0.074 


- 


- 


- 


0.072 


- 


- 


- 


0.089 


- 


- 


per 


195 






























0.072 


- 


- 


channel 


225 


































- 


[f^^ antenna] 


255 










0.054 










0.053 










0.10 








285 


; 


0.036 








; 


; 


0.036 


_ 




_ 


0.036 


_ 


_ 


0.13 








315 






























0.24 


- 






375 


0.043 


0.043 


0.046 


0.091 


0.085 


0.083 


0.043 


0.044 


0.091 


0.086 


0.043 


0.043 


0.091 


0.085 


0.31 




- 


Sf3d [10-3] 




0.96 


0.12 




0.95 


0.16 


- 


0.83 


0.074 


0.82 


0.10 


1.47 


0.19 


1.48 


0.25 


0.28 


0.18 


0.45 


(5 /3s [10-3] 




30 


2.9 




4.3 


2.2 




26 


1.9 


3.7 


1.4 


38 


3.9 


5.6 


2.9 


3.4 


2.2 


4.5 


5/3^x5/3^ 




-0.92 


-0.44 




-0.92 


-0.57 




-0.96 


-0.46 


-0.96 


-0.58 


-0.91 


-0.44 


-0.91 


-0.57 


-0.67 


-0.70 


-0.67 


Fl [l0~3j 




0.22 


0.26 




0.21 


0.24 




0.20 


0.23 


0.19 


0.21 


0.31 


0.37 


0.29 


0.34 


0.28 


0.25 


0.40 


F2 [l0~3] 




0.95 


0.0097 




0.16 


0.011 




1.1 


0.0057 


0.18 


0.0065 


0.79 


0.086 


0.14 


0.0094 


0.028 


0.018 


0.025 


F3 [nK^^b] 




5.4 


10 


5.3 


3.6 


7.4 


3.4 


5.4 


10 


3.6 


7.7 


5.4 


10 


3.6 


7.4 


14 


14 


14 



TABLE IV: Summary of the optimization results in the case of COrE considering channels only below 400 GHz. For each of 
the three masks, we present results for each of the three FOMs optimized under one of the two constraints, either fixing the 
focal plane area or the total number of detectors. The results for F0M#3 are quoted only once as they do not depend on the 
choice of the mask.The rightmost columns show the results computed using the original version of COrE as proposed in [3]. In 
the latter case the configuration is always the same, whatever the choice of the mask. 

the distribution depend on a type of the constraint. As 
the high frequency detectors have smaller area we find 
that the dust is better estimated {SPdust lower) under 
the total area constraint case as more high frequency de- 
tectors can be had. The opposite can be seen for the 
synchrotron estimation. The resulting levels of the resid- 
uals are however essentially identical in both these cases. 
More aggressive masking clearly helps, Mask I, but a bal- 
ance has to be maintained between lowering the overall 
foreground level and the precision of the spectral index 
determination. The latter, unlike the former, benefits 
from a larger number of pixels and higher foregrounds 
and, otherwise, can therefore start driving the eflFective 
residual up, e.g.. Mask II. 

The FOM#2-optimized configurations usually render 



good values for F0M#1 (within 10 - 15% of the best 
achievable values) , but result in the CMB map noise lev- 
els (F0M#3) up to twice higher than the best ones. The 
original versions of the considered experiments also yield 
the values of re// close to the best ones. 



3. F0M#3 optimization 

For this FOM, and in every considered case, the opti- 
mization of the focal plane with respect to the noise in 
the CMB map ends up with only three nonzero channels: 
two at frequencies as extreme as only allowed for, and one 
at an intermediate one contained in the CMB frequency 
band. The precise position of the latter is found again 
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channels 






P06 


mask 








ma 


sk I 






mas 


k II 




proposed vers 


ion 


Constraint 






area 






tot # 




area 


tot # 


area 


tot # 










(GHz) 


Fl 


F2 


F3 


Fl 


F2 


F3 


Fl 


F2 


Fl 


F2 


Fl 


F2 


Fl 


F2 


P06 mask 


mask I 


mask II 




30 


35 


62 


52 


472 


185 


601 


33 


61 


448 


168 


56 


62 


672 


187 


84 


- 


- 




45 


- 


491 


- 


- 


1016 


- 


- 


493 


- 


975 


10 


491 


1240 


1021 


364 


- 


- 


Number of 


70 


- 


- 


- 


4861 


- 


7646 


- 


- 


4935 


- 


- 


- 


3643 


- 


1332 


- 


- 


detectors 


100 


1970 


4056 


- 


2776 


3546 


- 


1400 


4101 


2579 


3567 


6311 


4049 


2583 


3544 


2196 


- 


- 




150 


13159 




16995 








14639 


- 


- 


- 


3518 


- 


- 


- 


3048 


- 






220 


823 


8328 






3164 




- 


8228 


- 


3207 


178 


8340 


- 


3157 


1296 


- 






340 


10364 


4525 


13210 


954 


1154 


817 


11586 


4259 


1102 


1148 


7988 


4566 


926 


1154 


744 


- 




Total area 




0.084 


0.084 


0.084 


0.16 


0.10 


0.20 


0.084 


0.084 


0.16 


0.099 


0.084 


0.084 


0.21 


0.10 


0.084 


- 




[number of detectors] 




26352 


17462 


30258 


9064 


9064 


9064 


27658 


17143 


9064 


9064 


18061 


17508 


9064 


9064 


9064 








30 


0.042 


0.074 


0.063 


0.29 


0.18 


0.30 


0.040 


0.073 


0.28 


0.17 


0.068 


0.074 


0.32 


0.18 


0.10 


- 


- 




45 


- 


0.26 


- 


- 


0.44 


- 


- 


0.26 


- 


0.44 


0.0054 


0.26 


0.26 


0.44 


0.19 


- 


- 


Fractional 


70 






- 


0.55 


- 


0.70 


- 


- 


0.57 


- 


- 


- 


0.31 


- 


0.29 


- 




area 


100 


0.21 


0.44 


- 


0.15 


0.31 


- 


0.15 


0.44 


0.15 


0.32 


0.68 


0.44 


0.11 


0.31 


0.24 


- 


- 




150 


0.63 




0.81 


- 


- 




0.70 


- 


- 


- 


0.17 


- 


- 


- 


0.15 


- 


- 




220 


0.018 


0.19 


- 


- 


0.057 


- 


- 


0.18 


- 


0.060 


0.0040 


0.19 


- 


0.057 


0.029 


- 


- 




340 


0.097 


0.042 


0.12 


0.0046 


0.0088 


0.0032 


0.11 


0.040 


0.0054 


0.0090 


0.074 


0.043 


0.0034 


0.0087 


0.0069 


- 


- 




30 


0.41 


0.31 


0.34 


0.11 


0.18 


0.010 


0.42 


0.31 


0.12 


0.19 


0.33 


0.31 


0.094 


0.18 


0.27 


- 


- 




45 


- 


0.11 


- 


- 


0.077 


- 


- 


0.11 


- 


0.078 


0.77 


0.11 


0.070 


0.077 


0.13 


- 


- 


Noise 


70 


- 


- 


- 


0.035 


- 


0.028 


















0.067 


- 


- 


per 


100 


0.055 


0.038 




0.046 


0.041 




0.065 


0.038 


0.048 


0.040 


0.031 


0.038 


0.048 


0.041 








channel 


150 


0.021 




0.019 








0.020 








0.041 








0.044 








220 


0.085 


0.027 






0.044 






0.027 




0.043 


0.18 


0.027 




0.044 


0.068 








340 


0.024 


0.036 


0.021 


0.079 


0.072 


0.086 


0.023 


0.038 


0.074 


0.072 


0.027 


0.036 


0.080 


0.072 


0.090 




_ 






0.25 


0.086 




0.71 


0.13 


. 


0.37 


0.055 


0.62 


0.055 


0.41 


0.14 


0.66 


0.21 


0.16 


0.10 


0.25 






2.39 


0.51 




1.5 


0.38 




3.2 


0.33 


1.4 


0.33 


2.7 


0.68 


0.67 


0.50 


0.55 


0.36 


0.73 


513^X5^3 




-0.66 


-0.46 




-0.96 


-0.48 




-0.10 


-0.48 


-0.88 


-0.49 


-0.88 


-0.46 


-0.54 


-0.48 


-0.63 


-0.65 


-0.62 


Fl [10~2] 




0.19 


0.20 




0.19 


0.20 




0.17 


0.18 


0.17 


0.18 


0.27 


0.28 


0.27 


0.29 


0.20 


0.18 


0.29 


F2 [10"-^] 




0.024 


0.0018 




0.059 


0.0023 




0.076 


0.0011 


0.069 


0.0014 


0.020 


0.0016 


0.012 


0.0020 


0.0041 


0.0026 


0.0036 


F3 [nK^^b] 




1.5 


2.7 


1.4 


1.6 


3.1 


1.5 


1.4 


2.7 


1.6 


3.2 


1.6 


2.7 


1.7 


3.1 


3.0 


3.0 


3.0 



TABLE V: As in Table IV but for CMBpol [4]. 



to be dependent on a type of the hardware constraint 
used. For the CMBpol satellite the values of the central 
frequencies are 70 or 150 GHz for the constraint on the 
total number of detectors and the area, respectively. For 
COrE they are 105 and 135 GHz, respectively. We re- 
call that in the case of this FOM all the spectral indices 
are assumed to be known, otherwise the three channel 
configurations derived here would be singular and would 
not permit a determination of the spectral indices. The 
achieved noise levels are better when the total number of 
detectors is constrained, and are lower by a factor up to 
1.6. The original versions of the satellites result in quite 
high noise (higher by a factor of 2.5 — 4) in comparison 
with the one derived for the optimized configurations. 



4- Consensus configuration 

Having postulated three different FOMs we have ob- 
tained three different, optimized configurations. More- 
over, as we have already mentioned, there is clearly ten- 
sion between some of the considered FOMs. The issue 
now is therefore how to find a compromise between them 
in order to select a single configuration as a result of 
our procedure. To do so we first recall that in our case 
the configurations preferred from the point of view of 
F0M#1 fail to ensure a satisfactory level of the residu- 
als, as quantified by F0M#2, while optimization of the 



latter yields a rather high level of noise, i.e., F0M#3. 
Simultaneously however optimizing F0M#1 effectively 
ensures a near optimization of F0M#3. Therefore we 
will retain the former as part of the optimization and 
drop the latter, which from now on will be used only as 
a benchmark to compare against the obtained configu- 
rations. As F0M#1 on its own is not fully satisfactory 
we will therefore optimize it, while imposing a constraint 
based on a value of F0M#2. Clearly if more FOMs are 
used more constraints can be introduced in the same way. 
What values to choose for the thresholds is a somewhat 
debatable question, an answer to which will depend on 
a specific application. In our case, we first note that for 
the FOM#2-optimized configuration the resulting re/ / is 
an order of magnitude lower than the respective value of 
rmin- The latter is moreover typically 20% higher than 
its corresponding best value. 

From the viewpoint of these two indicators the 
FOM#2-optimized solution looks therefore quite satis- 
factory. This is particularly true for the CMBpol case 
for which this solution can be accepted as indeed the fi- 
nal outcome of the procedure. For COrE the potential 
remaining problem could be the noise level. In search of 
the consensus configuration we may therefore want to let 
the residual grow, in particular, relatively to the value of 
rmin and gain in terms of the noise. Clearly the more 
we compromise on rg// the more we can gain on ctcmB' 
As for COrE the values of rmin are close to 2 x 10~^ 
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and we will allow Vmin to be as large as 10~^, and re- 
optimize the problem with respect to F0M#1 with the 
constraint that r^ff < 10~^. This specific choice is in fact 
arguably rather high. In fact we find that imposing more 
strict limits of Veff < 2.5 x 10~^ or 5 x 10~^ already 
can ensure satisfactory noise levels, 4.0 and 3.9 uKcmb, 
respectively, and thus could be preferred for the actual 
experiment optimization. We will however use hereafter 
the threshold of 10~^ as it is more useful for demonstra- 
tion purposes. 

The resulting configuration is shown in Fig. 10 
and summarized in Table VI, where we show the 
results obtained for the two hardware constraints. The 
spectra of the noise and residuals are also displayed 
in the right panel of the Figure. We conclude that 
the detector distribution indeed resembles a hybrid 
between two solutions obtained earlier as a result of 
the optimization of FOMs: #1 and #2 separately with 
a respective hardware constraint. Figs. 6 and 7. As 
anticipated above the overall level of the foreground 
residual spectrum is rather high as compared to both 
the B-mode spectrum and its respective variance due to 
the noise and the sky. However, as intended, the noise 
level has been successfully suppressed to the levels close 
to those computed for F0M#3 optimized configurations. 



5. Post-processing 

For definiteness in this Section we focus on a sin- 
gle, specific configuration, and choose for it the opti- 
mized COrE setup obtained from the optimization of 
the F0M#1 value, while constraining the corresponding 
value of F0M#2 to be no more than 10~^ and keeping 
the total number of detectors fixed, as discussed at the 
end of the previous Section. The details of this configura- 
tion are listed in the fourth column of Table VI together 
with the respective FOMs values. 

The procedure employed in this Section follows the 
steps outlined in Sec. II D. In Fig. 11 we show an im- 
pact of a fractional change of a number of detectors in 
one channel at the time on the values of the FOMs. The 
latter are given relative to their optimized values and 
therefore all the curves shown in the figure are expected 
to start from the unity for the fractional change equal 
to zero, as the latter corresponds to the optimized con- 
figuration, and then grow typically monotonically with 
an increasing value of the fractional change. In addition, 
for reference we also show how the FOMs values would 
change if numbers of detectors in all the channels are 
decreased by the same fraction. We note that at least 
for the two of the FOMs, i.e., F0M#2 and #3, the lat- 
ter dependence can be straightforwardly predicted using 
Eqs. (5), (12), and (17) and shown to be inversely pro- 
portional to an actual number of detectors in the corre- 
sponding configurations and thus inversely proportional 
to (1— fractional change of detectors). This indeed is ad- 



hered to by our numerical results. 

The most striking features of some of the results are 
their apparent flatness extending on occasions to a rather 
high values of the fractional change. At face value that 
suggests that one is at liberty to change a number of 
detectors in some of the channels rather drastically but 
without noticeably penalizing the performance of the in- 
strument. However, though some freedom indeed exists, 
it has to be exploited carefully. In particular, signifi- 
cantly changing a number of detectors in one selected 
channel, will usually have an eff'ect of removing any free- 
dom in adjusting the number of detectors in the remain- 
ing channels. Therefore if one's goal is to round-up 
the optimization results in a way to make them more 
amenable to an actual implementation that may not be 
the right way to go. Below we showcase some of these 
issues in the speciflc case at hand. 

Probably most conspicuous thing about the configura- 
tion considered here is the presence of a channel centered 
at 255 GHz, to which are assigned only 17 detectors, as 
opposed to a few thousands in some of the other channels. 
A natural question to ask is therefore whether this chan- 
nel is needed at all. In fact, the two outermost panels of 
Fig. 11 seem to confirm our feeling that this channel is in 
practice irrelevant as both the FOMs #1 and #3 effec- 
tively do not depend on its being present. This is not so 
however for the F0M#2 as shown in the middle panel. In 
this case removing this channel altogether will boost the 
value of this FOM, and thus the level of the foreground 
residual by a factor of ^ 1.5. Though not overwhelmingly 
large it is substantial enough to justify holding on to this 
channel (unless of course the hardware cost of having the 
extra channel tips the balance the other way) . These ex- 
pectations are confirmed by direct calculations, results of 
which as shown a 5th column in Table VI. (We note that 
an attempt to re-optimize the resulting 4-channel system 
a posteriori does not bring much improvement either; see 
Table VI, column 6). We note that trying to keep the 
level of residuals down in this case can be of particular 
importance given that already in its original, optimized 
version (Table VI) the resulting values of Vmin ^tnd rg// 
are close enough to each other that this is probably the 
latter, i.e, the level of residuals, which would drive the 
actual limit on a detectable r value for this setup, rather 
than the statistical estimate provided by F0M#1. Let- 
ting Vef f grow any further would therefore directly affect 
our science goals. Instead we can therefore try to trim 
a number of detectors in either 45 or 375 GHz channel. 
We see that we can potentially reject up to ^ 70% of the 
detectors in the former or ~ 80% in the latter, without 
affecting the residuals level (F0M#2) in any appreciable 
manner. This would have an effect of increasing F0M#1 
value by no more than ~ 5% and F0M#3 by no more 
than ~ 50%, both of which may therefore look perfectly 
acceptable. Whichever option we opt for, we can then 
reuse the spare detectors by distributing them to some 
of the existing channels or creating some additional ones, 
say at 165 GHz, in order to be better equipped to face 
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FIG. 10: Left panel: Results of the FOM#l-based optimization derived in the case of the COrE experiment with a constraint 
on F0M#2 (< 10~^), and using the P06 mask and channels with frequencies below 400 GHz. Right panel: Comparison of 
the power spectra corresponding to the proposed and optimized versions of the COrE experiment as listed in Table VI and 
visualized in the left panel. The spectra in blue (mid-level noise spectrum and highest residuals, these latter being depicted 
with dashed lines) correspond to the cases with the total area constraint. On the other hand, the spectra in magenta (lowest 
noise level, same residuals as previously) correspond to the cases with the detector number constraint. The foreground residual 
spectra in both of these cases overlap perfectly in the figure with the magenta curve being invisible. 
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TABLE VI: Comparison of performance of the variants of the COrE setups considered in Sec. IV D 5. All the optimization 
runs have been performed while keeping the total # of detectors constant, used the P06 mask and only the channels below 
400GHz. The configurations in the Table include, from left to right, (1) a result of the optimization procedure with respect to 
F0M#1 with a constraint on F0M#2 of < 10~^, (2) the same configuration but with the 255GHz channel suppressed, (3) a 
configuration with the same frequency channels as in (2), but with numbers of detectors re-derived via an optimization with 
respect to F0M#1 and a constraint F0M#2 < 1.5 x 10~^, and (4) a re-optimized configuration with the channels as before 
plus two extra ones with a fixed number of detectors (= 200 each). The last column shows the original COrE configuration for 
comparison. Numbers in bold correspond to parameters forced to be at a given value. 



some potential surprises (Sect, II D). However a special 
care then has to be taken if a number of detectors in 
some other channels needs to be concurrently decreased. 
This is because, as illustrated by lines marked with cir- 
cles in Fig. 11, not all directions in the parameter space 
are similarly flat. 

If our aim is to just round-up the detector numbers we 



can proceed as outlined in Sec. II D. We first postulate 
a set of fractional changes from the optimized values. In 
our case these could be [vk] = [1.025, 1.05., 1.1, 1.15] for 
F0M#1 and [vk] = [1.05, 1.25., 1.5, 2.0] otherwise, and 
then use Fig. 11 to read off the corresponding values of 
the fractional change for each channel and each FOM. 
These are values denoted a in Sec. II D. In our case for 
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FIG. 11: Dependence of the values of F0M#1 (left), F0M#2 (middle), and F0M#3 (right), on a fractional change of a number 
of detectors in the hardware configuration as detailed in the fourth column of Table VI. The solid lines show cases with a 
number of detectors in only one selected channel being gradually decreased (left to right) and all the others being kept fixed at 
their optimized values. The circles show the case with a number of detectors in all channels decreasing by the same fraction 
simultaneously. The color schemes for the lines are the same in all the panels and described in the legend. 




threshold / optimized FOM 



FIG. 12: The worst values of each FOM, t;, computed for 
each of the concentric hyperellipsoids, Eq. 20, defined by the 
threshold values, v, as shown on the horizontal axis. The 
dotted line shows v = v case. Clearly, ~ in all shown 
cases, where the latter approximate equality holds to within 
10%. The values of v and v given here are relative to the 
optimized values of the respective FOMs. 



F0M#1 they read 



409 496 555 577 

1017 1664 1771 oo 

880 1477 2236 2697 

oo oo oo oo 

442 549 624 654 



(26) 



where k-th column corresponds to the k-th value of 
and thus gives values of a for each of the five channels 
with nonzero number of detectors in the optimized con- 
figuration (see second column of Table VI). We can use 



these values to define, Eq. (20), hyperellipsoidal volumes, 
V/c, in the parameter space centered on the optimized 
configuration. We note that the infinity sign marks the 
cases, where the desired value of Vk could not have been 
reached due to the parameter space boundary. For in- 
stance, the values in the fourth row of Eq. (26) are all 
infinite as in the neighborhood of the optimized configu- 
ration the value of FOM^^l does not depend on a number 
of detectors in this channel as can be seen in Fig. 11. 

To find the worst case value of the FOM for a k-th hy- 
perellipsoid, v^^ we use random sampling of first an entire 
volume of the ellipsoid followed by that of only its sur- 
face. The latter requires fewer samples to ensure proper 
sampling density and is more efficient if we have some ex- 
pectation of the FOM values monotonically deteriorating 
away from the optimized configuration. As anticipated 
in Sec. II D the corrected values, v^, and initial ones, 
are indeed found to be quite close, typically within 20% 
of each other as illustrated in Fig. 12. 

The series of the concentric hyperellipsoids constructed 
here gives us a quick, though approximate, way to esti- 
mate the performance of some proposed configurations 
derived from the optimized one via small changes of all 
or some optimization parameters. As an example, con- 
sider a configuration with [dj] = [600, 1700, 3000, 17, 700] 
detectors in each of the five channels considered here. 
Given that for F0M#1, 



k 



Sky 



< 1 



(27) 



is fulfilled for any /c, we conclude that the respective value 
of F0M#1 for this case will not be larger than by a factor 
1.025 than the optimized value. Indeed a direct 
calculation renders a value 1.002 times higher than the 
optimized one in agreement with our quick estimation. 
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Similarly, we can deduce the performance of this config- 
uration as expressed by the two other FOMs. These are 
more sensitive at least to changes in some of the channels 
however we find that for this specific configuration we can 
lose no more than a factor of 1.05 for both of them. These 
could be compared to the actual values of 1.01 and 1.02, 
respectively, all relative to the corresponding optimized 
values. 

In this case overall the loss of performance seems rather 
benign and acceptable. Moreover, as a result of rounding- 
down the detector numbers we have gained around 100 
of those, which we can arbitrarily assign to any of the 
existing channels or even create a new one to saturate 
the constraint on the total number of detectors. What- 
ever decision we make we will not compromise any of the 
performance figures derived earlier. 

To illustrate a process of adding some ad hoc channels 
at this time we start from a configuration more drasti- 
cally stripped-down than the one discussed above. Let 
that be for instance [4] = [500, 1500, 3000, 0, 600], where 
we not only reduced numbers of detectors per channel 
more substantially but also removed the fourth channel 
altogether. Using the hyperellipsoid formalism we get 
quickly a helpful insight into how much we have lost as 
a result of choosing this configuration. As we already 
discussed, the biggest loss is found with regard to the 
value of F0M#2, which is boosted by more than 50% 
(but less than 100%) with F0M#1 and F0M#3 chang- 
ing by ^ 1.05 and ~ 1.1 respectively. (The actual values 
being 1.01,1.81 and 1.09 for FOMs: #1, #2, and #3.) 
However we have also gained as many as 400 detectors, 
which can be distributed at our discretion to fill the con- 
straint. Let us do so by introducing two extra channels 
at 195 and 285 GHz with 200 detectors each. This im- 
proves the performance of the considered configuration, 
an improvement which we can ameliorate even further by 
performing the optimization with respect to the detector 
numbers in the four original channels and keeping the 
detector numbers of the new channels fixed to 200. We 
indeed find that the new setup performs nearly as well as 
the initial optimized one (Table VI, column 3 vs 7) but 
possesses a more uniform frequency coverage. If we now 
want to perform a controlled detector number rounding 
and analyze its impact on the configuration performance 
we would need to restart the entire procedure described 
above. 



6. Robustness tests 

As explained in Sec. HE, for each FOM, we start from 
the optimized configurations, as determined earlier and 
check how the values of the FOMs depend on a random 
suppression of a number of detectors in each channel by 
some fraction. Specifically, we assume here that the dis- 
tribution of the anticipated detector failures is Gaussian 
with the dispersion equal to e of which is the same for 
each of the considered channel and taken to change from 
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FIG. 13: Summary of our robustness tests applied to the 
COrE configuration obtained via the optimization of F0M#1 
with constraints of F0M#2 < 10~^ and a fixed number of de- 
tectors. The lines of different colors correspond to different 
FOMs and different lines show: average (dotted), 95% confi- 
dence limit, (dot-dashed), and the worst value (solid). 



5%, 10%, 25%, and 50 %. We randomly draw some 
large number of samples, here 10^, and histogram the re- 
sults for each of the FOMs. We then compute the most 
likely value of the FOMs, 95%-confidence limit, and the 
worst drawn value. In the case of the COrE configura- 
tion studied in the previous Section we collect the results 
in Fig. 13. We conclude, as probably could have been 
anticipated from the results of the previous Section, that 
for a failure rate as large as 30% we will not compromise 
on the FOM values by more than 50% with respect to 
the optimized ones, while a failure rate of 10% will result 
in their 10% increase. These result affirm the practical 
soundness of the derived configuration. 



7. Robustness with respect to the foreground modeling 

Results of the optimization procedures including thus 
the procedure considered here are usually only as good as 
the foreground models used in their course. In the spe- 
cific case studied here we expect that our results are fairly 
robust as far as foreground morphology is concerned. Our 
estimates are driven by two compact description of those, 
the foreground correlation matrix, F, and the foreground 
power spectra, which are not expected to be wildly differ- 
ent than what we have assumed here. We note in particu- 
lar that an increasing amplitude of the foregrounds lead- 
ing to an increase of both the elements of the matrix, F, 
and overall normalization of the foreground power spec- 
tra would decrease the errors on the spectral parameters 
and result in the amplitude of the residuals being virtu- 
ally unchanged. These expectations are confirmed by the 
results obtained here for the three different masks. 

It is more difficult to assess, though potentially more 
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crucial, the impact of increasing a number of spectral pa- 
rameters. This could be either due to more complicated 
spectral dependences of true foreground components, or 
as a result of a spatial dependence of spectral parameters. 
The former problem is inherent to all parametric compo- 
nent separation approaches including the one assumed 
here. In general, a wrong parametrization or frequency 
scaling laws assumed in such approaches may invalidate 
separation results. In practice, the effects are more sub- 
tle but arising biases can affect an interpretation of the 
results. It is therefore important that the scaling laws 
assumed in the optimization continue to be improved, 
reflecting any relevant, new observational data and more 
detailed, theoretical models of the foreground physics as 
they become avail- able. In a case of some doubts, a 
rather conservative approach can be fruitful, restricting 
channel frequencies to a range for which the scaling laws 
are known to provide at least good approximations to the 
actual ones. This is in fact an approach we used in this 
work by selecting a parametric model for the dust signal 
with a single parameter and reduced the frequency range 
to those lower than 400 GHz. 

A spatial dependence of the scaling parameters can be 
treated more directly. We will implement that by divid- 
ing the observed sky into a multiple disjoint regions and 
introduce one set of parameters for each of those. To 
abstract from details of the regions shape and position, 
we assume that they are defined in such a way that the 
errors on spectral parameters are the same for each of the 
regions, i.e., that the differences of the overall magnitude 
of the matrix F are compensated by a respective number 
of pixels in each area. In general this assumption would 
imply that more, though smaller by area, regions are de- 
fined in high-contrast foreground sky areas. This indeed 
could well be the case as the high-contrast foreground re- 
gions are expected to be more complex and may require 
more parameters to ensure sufficient accuracy. 

For demonstration purposes we assume that we have 
10 regions with the corresponding errors on spectral pa- 
rameters being \/T0 times larger than in the single region 
case as studied before. We note that cutting the sky into 
regions will unavoidably affect the foregrounds and thus 
residual power spectrum on scales larger than a typical 
size of the region. We will ignore this effect here, mo- 
tivated by the fact that our earlier results did not find 
any strong dependence on the shape of the power spec- 
trum. We also neglect here all practical difficulties such 
as matching the results on the map level coming from the 
different regions and which will have to be addressed in 
any actual application of the discussed method. We limit 
here ourselves to the COrE-like configuration as defined 
earlier, calculate the FOMs as before, and optimize the 
configuration following the steps outlined before. As ex- 
pected we find that the optimal configurations this time 
are not very different from the ones obtained earlier. This 
is because F0M#1 and F0M#3 are mostly trying to op- 
timize the overall noise level, which is the same now as 
before, and though the value of F0M#2 increased by a 



factor 10 due to increase of the spectral index errors this 
is the same configuration, which ensures its minimum. 
As a consequence the new value of r^ff is now higher 
than that of Vmin • This clearly does not invalidate results 
of the optimization procedure as such, however care has 
to be exercised, while interpreting the obtained values of 
Tmin^ which may not be taken directly as the performance 
forecasts for the setup as far as detecting r is concerned. 



E. Varying the number of channels and their 
frequencies 

We present here some results based on an implemen- 
tation of the scheme proposed in Sec. II C. We start 
from ^ 70 channels evenly spaced between 45 and 375 
GHz every 5 GHz, with ~ 6000 detectors (total num- 
ber of the COrE proposed version) equally distributed 
among those, as shown in Fig. 14. Then we perform 
the optimization with respect to F0M#1, while keeping 
F0M#2 < 10~^ and the total number of detectors fixed. 
As a result we obtain a highly clustered distribution of 
detectors in between the initial channels, with many of 
these being empty. We therefore combine detectors of 
neighboring channels and replace them by a new chan- 
nel with the central frequency set as a weighted, by a 
number of detectors, mean of the optimized distribution. 
The new channels are defined to ensure proper spacing 
between them. Once the new channels are determined we 
perform a second round of the optimization, this time in- 
voking only the new channels and aiming at optimization 
of the detector distribution between them. The result is 
shown in the left bottom panel of Fig. 14. We note that 
the procedure not only improved the values of the FOMs 
with respect to the starting (original) configuration, i.e., 
F0M#1 has been decreased by - 17% (- 25%), while 
the noise by a factor ~ 4 (~ 3), but also, and arguably 
most importantly, it resulted in a configuration signifi- 
cantly simpler than the initial one with the number of 
channels reduced from 70 down to 9. 

We note that maybe somewhat surprisingly both the 
configurations derived here, the final one as well as the in- 
termediate one obtained after the first optimization step, 
show only a minor, ~ few percent, gain over the five- 
channel configuration we have considered earlier; see, 
e.g., the first column of Table VI. This is due to our 
setting the threshold for F0M#2 rather high, while the 
main advantage of the significantly larger set of the initial 
channels used here is that it permits finding in principle 
more satisfactory compromises between the three FOMs, 
characterized by values of F0M#2 lower than what could 
be achieved with more modest setups discussed earlier. 



V. CONCLUSIONS 

In this work we have proposed a general scheme for a 
performance optimization and forecasting of the CMB B- 
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FIG. 14: Demonstration of the optimization results derived with respect to a variable number of channels, numbers of detectors 
per channel, and their central frequencies, while constraining the total number of detectors (= 6128 as in the proposed 
COrE version). Upper left panel shows, from top to bottom, (1) the starting configuration with all the detectors evenly 
distributed among a fine-grid of channels; (2) a configuration after the first optimization of F0M#1 constrained to ensure that 
F0M#2 < 10~^; and (3) the reoptimization of configuration (2) restricted only to channels with a number of detectors larger 
than five and after adjacent channels merging and recentering as described in Sec. IV E. Right panel shows power spectra 
corresponding to these configurations contrasted against the expected CMB signals. 



mode experiments in the presence of astrophysical fore- 
grounds. Our approach is based on a maximum like- 
lihood parametric technique for component separation, 
for which we have derived Fisher-like error estimates 
for spectral parameters. We use the latter to calcu- 
late the residual level of the foregrounds in cleaned CMB 
maps given assumed, instrument characteristics and fore- 
ground model. We then optimize the former by minimiz- 
ing a set of proposed figure of merit indicators, which 
reflect our science goals. Subsequently we have applied 
this approach to two specific cases of recently proposed 
CMB B-mode satellites: American CMBpol [4] and Eu- 
ropean COrE [3]. We have discussed in detail the choices 
and trade-offs inevitable in such an optimization process. 
We have demonstrated how such a procedure can help to 
simplify the resulting hardware design, while ensuring 
the same (or nearly the same) science outcome. 

We emphasize that results of such a procedure can be 
only as reliable as the foreground models that are ap- 
plied. This underlines the importance of developing bet- 
ter understanding of the polarized foregrounds, in partic- 
ular, and characteristically of the parametric methods, as 
far as the functional form of the foreground component 
scaling laws is concerned. However, our approach is ex- 
pected to be relatively robust as far as other details of the 
foreground signals are concerned, such as, spatial distri- 



bution or spatial variability of the spectral parameters, 
with the latter playing a major role in determining the 
scientific reach of the experiment but not affecting its 
configuration. 

We also note here that the presented framework could 
be extended to work with any component separation 
method, which implements the separation by first esti- 
mating the mixing matrix, in a parametric or nonpara- 
metric way, and which is capable of producing estimates 
for the errors of the spectral parameters for any hardware 
configuration. One could, and ideally would, therefore 
use the formalism proposed here to define configurations, 
which would ensure that many of the available compo- 
nent separation methods perform well. Though the com- 
ponent separation methods usually conform with the first 
requirement, the second is more demanding and typically 
can be done only via computationally-heavy Monte Carlo 
simulations. Those may be often impractical for the op- 
timization purposes, making an implementation of such 
a program difficult. A related, but simpler to address, 
problem is whether the configurations optimized with one 
method will work for satisfactorily with the others. We 
will leave an investigation of both these issues to future 
work. 

As we point out in the introduction, FOMs required 
for the optimization procedure are also suitable for the 
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performance forecasting. This also clearly applies to 
the FOMs proposed here and in particular F0M#1 and 
F0M#2 seem relevant to the primordial signal detection 
producing values of rmin and rg// on order of O (lO""^) 
for the considered optimized configurations. However, 
given that each of these two FOMs reflects a somewhat 
different aspect of the problem - a statistical uncertainty 
in former cases versus a systematic one in the latter - care 
has to be taken while interpreting these values. Never- 
theless, as is, our results seem to support at least the 
contentions made elsewhere suggesting that r 10~^ is 
a realistic goal for the experiments considered here. 

Finally, we point out that the science goals we 
have posed for the considered CMB experiments are 
clearly more modest than those targeted by the original 
CMBpol and COrE designs. This is responsible, at 
least in part, for the more complex and advanced 
instrumental configurations as proposed in the original 



proposal. More diverse science goals can, and should, 
be studied in the presented framework. We leave this 
as well as considerations of other possible extensions, 
e.g., an inclusion of some of the instrumental effects, to 
future work. 
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APPENDIX A: SPECTRAL LIKELIHOOD DERIVATIVES. 



We present here some details of the derivation of Eqs. (4) and (5). 
First from Eq. (3) we have 



d\nC 
dp 



J2 (A,^Sp)' {dp-Asp) 



from which the second derivatives of the spectral likelihood follow as 

go go; = J2 | ( Sp + A,^ Sp,^0^ {dp - ASp) - {A^^ Spf N'^ {A^^r Sp ^ A Sp,^^) j 

And the noise ensemble average reads, 

P 

+ tr [A^^ N-^ {(dp -Asp) sl^^,)noisc 



tr 



tr [A^^N-^ A(sp,^.s^)noise] 



From Eqs. (1) and (2) we now have 

\ ^ P / noise 



Spsl + (A^N-^A) \ 

-Sp4 (A^^,N-^A + A^N-^A,;30 (A^N-^A)"' + Spq^(^,) 
(A^N-^A)"^ (A^^,N-^A) (A*N-^A)"^ 
(As - Asp) s^p 

- (As - Asp) 4 (A^^,N-^A + A^N-^A,;30 (A^N-^A)"^ 
+ (As - Asp) q^(^,) + A^p> (A^N-^A)"' 
+ A (A^N-^A)"^ (A*N-^A,^/) (A*N-^A)"\ 

where A and s are the true mixing matrix and sky components, respectively, s is a component estimate 
of noiseless experiment, i.e.. 



(Sp ^P.^') noise 

((dp A Sp) Sp^Yioise 
{{dp A Sp) ^p^^f) noise 



and q(^) is defined as. 



Sp = (A* N-^ A) ^ A* N-^ A Sp 



qp(^,) ^ (A^N-^A) 'A^^,N-iAsp. 



Hence, 



7 9^ \nC \ 
\ a/3 a/37. 



- ^ { (A,^^^Sp)' (Asp - Asp) + (A,^Sp)' (A,^. Sp) 
p 

+ tr [a^;3N-i (Asp - Asp) s^ (A^^,N-iA + A^N-iA,;30 (A^N-^A)"' 
(A,/3qp,(/3o)^ (Asp - Asp) 

A^^N-^A (A^N-^A)"^ (A^^,N-^A + A^N-^A,^/) Sps; 



tr 

tr 



+ tr [A^^N-iAqp(^,)sy}. 
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Moreover assuming now the true values of the spectral indices, i.e., P = $^ 



^ 1^ ^profile 

dp dp' 



+ tr 



tr 



P 

A^^N-^A (A*N-^A)"^ (A^^,N-^A + A*N-^A,^/) ^s^s^ 

p 

A]^ N-^ A (A* N-^ A)"^ A^^, N"^ A ^ 

p 

tr |[a^^N-^A (A^N-^A)"^ A^N-1A,;3' - A^^ A,^. ] ^ s^j , (AlO) 



from which Eq. (5) follows. 



APPENDIX B: FISHER MATRIX ALGEBRA. 

The Fisher matrix can be expressed as [24], 

where C is the covariance matrix and A is some param- 
eter. 

In our case, Aq, = A/3 = r, the tensor-to-scalar ratio, 
while the covariance matrix in a harmonic space, C, is 
given by. 



where, 



^ = ^33' = {^ImA'm')^ 



j = r + f + m, 

i = round[(-l + Vl + 4j)/2], 



(B2) 



(B3) 



and thus j goes from to {imax + 1)^ — 1- The function 
round rounds a real number to a closest integer. The 
Fisher matrix expression can be now specialized as. 



3,3' 



33' dr 



(B4) 



where j {j') is related to I as in Eqs. (B3). 

Because there are three uncorrelated contributions to 
the overall signal, which are CMB, noise, and foreground 
residuals, we can write. 



^33' 



CCMBx I r^noise ^ i £ 
I ^33' + ^33' + Jj Jj' 



where fj stands for a vector of coefficients ar- 
ranged according to tlie j index. 

To compute the matrix used in equation (Bl), we 
can use the Sherman-Morrison formula obtaining 



C"^ = - D-V(l + /^D-V)"V^D-S (B6) 



where (1 + /^D ^/) ^ is a number and, hence, 



C"^ =D 



" (l + /tD-V)' 



(B7) 



which, given that [D = (1/C^)^jj/, becomes 



ax /^A 

t'=0 



, (B8) 



where is a residuals power spectrum, Eq. (12), de- 
fined here as 



2^ + 



(B9) 



(B5) So now we have finally 
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[c-^v = % - / — — + — — (BIO) 




^3 / . , V- , . ^ ^'M ^2 1 + V (2f ' + 1) 



^ . 



which inserted into Eq. (B4) gives Eq. (15). 



